Doctoral Thesis Proposal Automatic Detection and Classification of Prosodic Events
نویسنده
چکیده
Speech prosody is a valuable carrier of information. Accents and phrase boundaries have been shown to contribute to syntactic disambiguation, semantic, pragmatic and paralinguistic interpretation, and to convey information about topicality, focus, contrast and information status. This thesis will present and evaluate techniques to detect and classify these prosodic events. The acoustic correlates of accents, phrase boundaries and phrase-final tones will also be examined. Spoken language processing systems have not made widespread use of prosodic information. We hypothesize that access to this information should improve the performance of many SLP applications. To support this, we will present proof-of-concept examples integrating hypothesized prosodic event information into speech synthesis, story segmentation, extractive summarization, and prosody tutoring applications.
منابع مشابه
Automatic Detection and Classification of Prosodic Events
Automatic Detection and Classification of Prosodic Events Andrew Rosenberg Prosody, or intonation, is a critically important component of spoken communication. The automatic extraction of prosodic information is necessary for machines to process speech with human levels of proficiency. In this thesis we describe work on the automatic detection and classification of prosodic events – specificall...
متن کاملStudies on Bird Vocalization Detection and Classification of Species
Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Seppo Fagerlund Name of the doctoral dissertation Studies on Bird Vocalization Detection and Classification of Species Publisher School of Electrical Engineering Unit Department of Signal Processing and Acoustics Series Aalto University publication series DOCTORAL DISSERTATIONS 166/2014 Manuscript submitted 12 June 2014 Date o...
متن کاملAutomatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues
We investigate automatic approaches to finding “hidden” spontaneous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosodically by a large array of automatically extracted energy, duration, and pitch features, and are modeled by decision tree classifiers; lexical cues are modeled by N-gram language models. Both sources of in...
متن کاملAutomatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique
The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...
متن کاملAutomatic Punctuation and Disfluency Meetings Using Prosodic An
We investigate automatic approaches to finding “hidden” spontaneous speech events, such as sentence boundaries and disfluencies, in multi-party meetings. Hidden events are characterized prosodically by a large array of automatically extracted energy, duration, and pitch features, and are modeled by decision tree classifiers; lexical cues are modeled by N-gram language models. Both sources of in...
متن کامل